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The development and progression of lung adenocarcinoma, one of the most common cancers, is driven by the interplay 
of genetic and epigenetic changes and the role of chromatin structure in malignant transformation remains poorly 
understood. We used systematic nucleosome distribution and chromatin accessibility microarray mapping platforms to 
analyze the genome-wide chromatin structure from normal tissues and from primary lung adenocarcinoma of different 
grades and stages. We identified chromatin-based patterns across different patients with lung adenocarcinoma of 
different cancer grade and stage. Low-grade cancers had nucleosome distributions very different compared with the 
corresponding normal tissue but had nearly identical chromatin accessibility. Conversely, nucleosome distributions of 
high-grade cancers showed few differences. Substantial disruptions in chromosomal accessibility were seen in a patient 
with a high-grade and high-stage tumor. These data imply that chromatin structure changes during the progression 
of lung adenocarcinoma. We have therefore developed a model in which low-grade lung adenocarcinomas are linked 
to changes in nucleosome distributions, whereas higher-grade tumors are linked to large-scale chromosomal changes. 
These results provide a foundation for the development of a comprehensive framework linking the general and locus- 
specific roles of chromatin structure to lung cancer progression. We propose that this strategy has the potential to 
identify a new class of chromatin-based diagnostic, prognostic and therapeutic markers in cancer progression. 



Introduction 

Since early in the last century, investigators have known that 
chromosome alterations can lead to cancer. 1 In fact, a hallmark 
of the transformed phenotype is altered chromosome structure, 
and important studies have identified many chromosomal aber- 
rations used to mark cancer type. 2 Studies at the molecular level 
have shown that these transformation events are driven by the 
interplay of genetic and epigenetic changes. 3 ' 4 Specifically, large 
chromosomal changes have been observed in different cancers, 
and susceptibility of gene loci, losses and gains and translocations 
have been identified at specific chromosomes. 5 " 10 

The progression of lung adenocarcinoma (LAC) can be classi- 
fied by histopathological grade and tumor stage. Tumors of differ- 
ent grades can be distinguished by their gene expression profiles, 
and gene expression changes are thought to be driven by tumor 
grade. 1112 Studies have also revealed a correlation between tumor 
stage and mutations at specific genes. 13 Beyond the analysis of 
specific gene loci in the context of tumor grade and stage, docu- 
mentation of the genetic or epigenetic basis for the progression of 
cancer is insufficient. As chromosomal aberrations are well docu- 
mented in nearly all cancers, the currently limited information 
on the role of chromatin structure in the progression of LAC is 
surprising. Moreover, although numerous studies have addressed 
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chromosomal aberrations in cancer, broad assessment of the 
underlying structure of chromatin has been understudied, and its 
role in malignant transformation remains poorly characterized. 

Understanding the functional organization of the genome 
remains one of the biggest challenges in biology. Eukaryotic 
genomes consist of DNA that is packaged together with histone 
proteins into chromatin. The basic subunit of chromatin is the 
nucleosome, which is composed of approximately 150 base pairs 
of DNA wrapped 1.65 times around a histone octamer. 14 The 
distribution of nucleosomes is controlled by a combination of 
chromatin regulatory complexes and features intrinsic to DNA 
sequence. This organization results in architectures that facilitate 
or impede DNA-binding interactions required for nuclear pro- 
cesses, such as transcription, replication, recombination, repair 
and transposition. 15 The higher-order structure of chromatin is 
not well characterized despite its central importance to myriad 
genomic processes. 16 Knowledge of nucleosome distribution and 
chromatin accessibility provides a critically important point of 
reference for interpreting epigenetic modification and high-order 
structures in diseases such as cancer. 17 " 19 

Systematic analysis of chromatin structure has provided 
insights into the mechanism behind chromosome alterations. A 
classic method for analysis of chromatin structure first described 
more than 25 years ago uses nuclease sensitivity as a probe of 
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highly diffusible molecules. 20 This assay remains a powerful 
method for measuring changes in higher-order chromatin struc- 
ture and has been enhanced by use in combination with modern 
genomic tools. Recent advances in microarray technology have 
facilitated application of this classic assay to scales at multiple 
levels of resolution across the entire genome, where nuclease was 
used as a probe for determining inaccessible and accessible regions 
of the genome. We have used a systematic analysis of chromatin 
structural information for lung cancer progression to seek a better 
understanding of cancer etiology. 19 ' 21 ' 22 

Our work indicates that there is an association between chro- 
matin structure and the progression of cancer. We have shown 
this link through the analysis of chromatin structure of LAC 
patients of different grades and stages at two scales: nucleosome 
distribution and chromatin accessibility. Furthermore, we have 
been able to identify discrete chromatin-based patterns that are 
consistent with different grades of cancer. 

Results 

Identification of the nucleosome distribution changes in lung 
adenocarcinoma. To investigate the role of chromatin structure 
in the progression of LAC, we sought to determine if changes 
in nucleosome distribution were a feature of LAC. We used a 
microarray-based assay to analyze primary LAC tumors and their 
corresponding normal tissue from multiple patients. To analyze 
nucleosome distribution, we isolated mononucleosomally pro- 
tected DNA from MNase-digested chromatin of both tumor and 
normal tissue from each patient. Using microarrays containing 
886 transcription start sites of cancer and immunity-related genes 
(Fig. 1A), we were able to detect changes in nucleosome distribu- 
tion between tumor and normal, for each patient. We devised a 
systematic and unbiased statistical method (using a sliding win- 
dow t-test) for comparing differences between the normal and 
tumor tissue for each patient. 

Low-grade tumor samples show many changes in nucleo- 
some distribution. To assess the role nucleosome distribution 
plays in cancer progression, we clustered the patient samples on 
the basis of histopathological grade, and found that low-grade 
tumors showed many changes in nucleosome distribution. We 
defined changes using a sliding window t-test across the 2,000 
bp surrounding the transcription start site of each of 886 genes, 
and if the overall t-test mean was above a significance threshold, 
the nucleosome distribution at that gene was considered signifi- 
cantly changed. The nucleosome distribution in the tumor and 
corresponding normal tissue differed at an average of 446 of 886 
(50%) genes in the grade 1 samples, 57 (6%) genes in the grade 2 
samples and 158 (18%) genes in the grade 3 samples (Fig. IB). We 
were particularly struck by the number of differences in nucleo- 
some distribution seen in the grade 1 patients: #1357 (633 genes) 
and #4137 (490 genes). In addition, the differences between nor- 
mal vs. tumor at each probe for all genes are shown as boxplots for 
two grade 1 patients (#1357 and #4137) and two grade 3 patients 
(#873 and #386), showing that the range of differences in the 
grade 1 patients is much broader (Fig. 1C). Correlation plots are 
shown between normal data and normal data, grade 1 data and 



grade 3 data (Fig. ID, from left to right). The Spearman's rank 
correlation coefficient is shown in each of the plots; the normal- 
normal and grade 3 -normal coefficients are both -0.9, but the 
grade 1-normal coefficient is lower, at 0.78, consistent with the 
many changes in nucleosome distribution for grade 1 tumors. 

The nucleosome distribution plots for the ATM gene, a highly 
mutated gene in LAC, 13 are shown in plots of normal compared 
with normal, grade 1 and grade 3 data (Fig. IE, from left to 
right). These plots show that normal lung epithelial tissue from 
a patient has a similar nucleosome distribution to both a normal 
tissue from a different patient and to a high-grade LAC tumor, 
but a normal tissue compared to a grade 1 LAC shows multiple 
changes, including a complete loss of a nucleosome. Altogether, 
these results suggest that the molecular pathology of lower grade 
LAC is defined by nucleosome distribution change at specific 
gene loci. 

Nucleosome distribution changes in low-grade tumors at 
LAC-specific gene loci. We next wanted to see if nucleosome 
distribution changes in grade 1 tumors occurred at genes previ- 
ously reported in the literature as being implicated in LAC, and 
if the changes were consistent between patients with tumors of 
the same grade. In order to visualize nucleosome distribution 
changes, we have shown nucleosome distribution plots for five 
genes selected from the 446 genes showing changes grade 1 LAC 
patients, which are reported as being cancer-related, two of which 
are specific to LAC carcinogenesis: ATM, MYC, RHOC, ITGA4 
and MLL5 (Fig. 2A— E). The grade 1 tumor samples (red lines) 
show changes in nucleosome distribution compared with normal 
samples (black lines) for all genes here, whereas grade 3 tumor 
samples do not. In each case, the gene loci that showed the great- 
est differences in the grade 1 patients shared an overall depletion 
of nucleosomes, suggesting a common mechanism of chromatin 
regulation in LAC. 

Nucleosome distribution changes are consistent between 
patients. Importantly, changes in nucleosome distribution 
for grade 1 samples are consistent between different patients 
(Fig. 2A-E). We were able to identify patterns of nucleosome- 
distribution changes consistent between and unique to the grade 
1 tumors (Fig. 2A-E, "grade 1" column). In a majority of cases 
we found that nucleosome distribution changes at particular 
genes in grade 1 samples were similar. Grade 2 and grade 3 tumor 
samples rarely deviated from the nucleosome distribution pat- 
tern seen in normal tissue (Fig. 2A-E, "grade 3" column). We 
compared the loci from grade 1 patients, and found that 168 of 
the 446 genes with altered nucleosome distribution were shared 
in common between patients (Fig. 2F). Gene ontology analysis 
showed that these shared genes were enriched for the regulation 
of the phosphoinositide 3-kinase (PI3K) cascade. 

Identification of the chromatin accessibility changes in 
LAC. To investigate high-order chromatin structure in the pro- 
gression of LAC, we determined if global chromosomal archi- 
tecture was consistent with the trend we saw with nucleosome 
distribution experiments. Using a novel chromatin accessibility 
assay, where we isolated inaccessible and accessible fractions of 
the genome (Fig. 3A) from the same material used for the nucleo- 
some distribution experiments, we were able to map changes in 
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Figure 1. Low-grade tumor samples show many changes in nucleosome distribution. (A) A flowchart of the general method used to determine 
nucleosome distribution from primary lung tissue. Snap-frozen tissue from clinically resected lung adenocarcinomas of different grades and matched, 
normal lung epithelium were digested with a titration of MNase and resolved on an agarose gel. Mononucleosomes were gel purified and labeled 
with Cy3, and bare genomic DNA was extracted from the same tissue labeled with Cy5. Samples were hybridized to tiling transcription start site (TSS) 
microarrays, and the data plotted in R. (B) Percentage of genes out of 866 with the greatest nucleosome-distribution differences between normal and 
tumor tissue for each patient, determined by t-test. (C) Box plot of the differences between normal and tumor tissue for two grade 1 patients, #1357 
and #4137, and two grade 3 patients, #873 and #386. (D) Scatterplot with correlation line and R-value for normal data plotted against (from left to 
right) normal data, grade 1 tumor data and grade 3 tumor data. (E) Read-out from microarray of the ATM gene for normal tissue vs. (from left to right) 
normal tissue (black) plotted against normal tissue, grade 1 tumor tissue and grade 3 tumor tissue (red lines). The x-axis represents a 2-kb range of 
genomic position centered on a TSS. Inferred nucleosomes are represented graphically (black ovals represent normal tissues; red ovals, tumor tissues). 
The y-axis is the log ratio of nucleosomal to bare genomic signal at each probe on the microarray. 
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chromatin accessibility. A 1 Mb non-overlapping sliding window 
t-test revealed regions across the genome in which each tumor 
sample and its corresponding normal tissue differed. The genome 
included approximately 1,660 nonoverlapping 1 Mb regions, in 
which resolution of the microarray (12.5 kb) and the number of 
probes within each window were taken into account. When the 
regions that changed were clustered according to cancer grade, 
we did not see the same trend as in the nucleosome-distribution 
experiments (Fig. 3B). Instead, we found that no tumor samples 
showed many regions of change in chromatin accessibility, except 
for a single grade 3 sample, #873. Clustering the patient samples 
by stage revealed that patient #873, who had the highest percent- 
age of regions changed, was also at the highest LAC stage, HB. 

The chromatin-accessibility plots are consistent with sta- 
tistical analysis, in which the tumor samples are indistinguish- 
able from normal samples, with modest variations (specifically 
patient #620, modest variation which can account for the higher 
percentage of regions changed compared with other samples). 
In patient #873, large-scale, catastrophic changes in accessibil- 
ity were observed (Fig. 3C). When the normal tissue from five 
patients were examined, the plots were indistinguishable from 
each other, with scarce variation, confirming the sensitivity of 
this assay (Fig. 3D). 

Discussion 

In the present study, we analyzed the nucleosome distribution at 
the transcription start sites of 886 genes and genome-wide chroma- 
tin accessibility of tumors from LAC patients with matched nor- 
mal tissue. Surprisingly, we found: (1) a majority of genes studied 
showed significant nucleosome distribution changes; (2) nucleo- 
some distribution changes are limited to low-grade tumors and are 
not observed in high-grade tumors; (3) nucleosome distribution 
changes are consistent between different patients and (4) global 
chromatin accessibility profiles show catastrophic changes in 
aggressive tumors. Our results argue strongly for the importance 
of chromatin structure in the progression of cancer. 

We present the first identification of nucleosome distribution 
changes between tumor and matched normal primary patient tis- 
sue in LAC. We found that of the 886 genes studied, we were 
able to detect changes in the majority of genes. Furthermore, we 
found that nucleosome distribution changes in the majority of 
genes occur in low-grade tumors, while no changes were evident 
in high-grade tumors. In fact, nucleosome distribution changes 
appear to potentiate early transformation events, due to the find- 
ing that changes were observed only in low-grade tumors. 

The mechanism by which nucleosome distribution changes 
in low-grade LAC tumors contribute to or permit the progres- 
sion to more aggressive tumor is unclear, but our findings are 
consistent with a critical role for chromatin structural alterations 
as an early transformation event. Importantly, nucleosome dis- 
tribution changes in low-grade tumors are concordant between 
patients, pointing to a shared regulatory mechanism that drives 
the changes. One attractive explanation involves ATP-dependent 
chromatin remodelers, which are responsible for altering the posi- 
tions and density of nucleosomes with respect to the underlying 



DNA sequence. Recent data from exome sequencing in matched 
normal and tumor patient samples have shown chromatin remod- 
elers to be frequently mutated in multiple cancers. 23 ' 24 In addition, 
subunits of the chromatin remodeler SWI/SNF complex can act 
as tumor suppressors in cancer; specifically, loss of BRG1 and 
BRM expression has been reported as an indicator of poor prog- 
nosis in non-small-cell lung cancer. 25 ' 26 Our data suggest that the 
observed nucleosome distribution changes are an early, regulated 
transformation event governed by chromatin remodelers. 

The genomic loci showing measurable differences in nucleo- 
some distribution in the low-grade samples are enriched for genes 
involved in the PI3K pathway. Activation of this signaling net- 
work commonly occurs in human tumors through mutation of 
the components involved in the pathway and has been specifically 
implicated in LAC. 13,27 ' 28 The pathway is activated by growth 
factors, which activate RAS through receptor tyrosine kinases, 
leading to stimulation of the PI3K pathway and other pathways 
involved in the control of cellular growth, proliferation and sur- 
vival. Studies have successfully used PI3K-inhibitory drugs in 
transgenic mouse models to block the growth of lung tumors. 29 
In addition, clinical trials involving small molecular inhibitors 
of PI3K have recently begun, 27 indicating the importance of this 
pathway as a potential cancer therapy. If changes in nucleosome 
distribution act as an early indicator of impending gene-expres- 
sion changes, then our nucleosome-distribution measurements 
could act as predictive indicators of early transformation events 
in this key oncoregulatory pathway. 

By contrast to nucleosome distribution experiments, global 
chromatin accessibility experiments for the same patients do not 
show the same trend. In fact, low-grade tumors did not differ 
from normal tissues in chromatin accessibility, except in a high- 
grade, high-stage tumor, where extreme chromatin accessibility 
changes across the entire genome were observed. A recent study 
has revealed that in 2-3% of different cancer types, including 
lung cancer, a single cellular crisis event called chromothripsis 
occurs, in which tens to hundreds of chromosome rearrangements 
are acquired. 28 Our measurements of substantial chromosomal 
accessibility alterations in the high-grade, high-stage tumor is 
consistent with a chromothripsis event. Our results might well 
reflect the pathology of chromothripsis and the role it may play 
in cancer progression. 

While screening and diagnostic methods have been identified 
for some cancers (e.g., breast, cervical, prostate), robust indica- 
tors have not been identified for lung cancer. In fact, the long- 
term survival rate for patients with lung cancer remains low, and 
molecular markers that have been identified to date are unsuit- 
able for clinical trials. 30,31 The identification of nucleosome dis- 
tribution and chromatin accessibility profiles provides a new 
set of genotype-independent and gene expression-independent 
measurements with which to classify LAC. In our samples we 
were able to identify nucleosome distribution changes common 
to all grade 1 LAC, as well as patient-specific nucleosome dis- 
tribution changes. Likewise we identified catastrophic chromo- 
somal changes in aggressive LAC. These data have allowed us to 
develop a model in which early-grade lung adenocarcinomas are 
linked to changes in nucleosome distributions, while later-grade 
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Figure 2. Consistency of nucleosome distribution changes for key cancer-related genes between patients. The nucleosome distribution for normal 
tissue (black lines) and tumor tissue (red lines) from four patients with grade 1 tumors (#1357 and #4137) and grade 3 tumors (#873 and #386) for five 
genes implicated in lung adenocarcinoma and in cancer in general: (A) ATM (seen previously in Fig. 1), (B) MYC, (C) RHOC, (D) ITGA4 and (E) MLL5. The 
x-axis represents a 2-kb range of genomic position centered on a transcription start site. The y-axis is the log ratio of nucleosomal to bare genomic sig- 
nal at each probe on the microarray. (F) 168 genes with the highest percentages of nucleosome-distribution changes are shared by grade 1 patients 
and are enriched for the regulation of the PI3K cascade. G1, grade 1. 



cancers are linked to large-scale chromosomal changes (Fig. 4). 
Further analysis of the chromatin structural changes presented in 
this work may provide new explanations of the etiology of early 



and late transformation events. Finally, these chromatin struc- 
tural profiles have the potential to serve as potent objective diag- 
nostic and prognostic tools to be used in the determination of 
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Figure 3. High-grade, high-stage tumors show major changes in chromatin accessibility. (A) A flowchart of the general method used to determine 
chromatin accessibility from primary lung tissue. The samples were identical to those used for the nucleosome distribution experiments. A single 
concentration of MNase from the titration generated previously was selected, and high-molecular-weight and low-molecular-weight fractions (cor- 
responding to inaccessible and accessible regions of chromatin, respectively) were gel purified. Inaccessible fractions were labeled with Cy3 and 
accessible fractions with Cy5. Both were hybridized to a tiling microarray spanning the entire genome with probes spaced every 12.5 kb, and the data 
were plotted in R. (B) Percentage of regions in which normal and tumor tissue for each patient differed according to 1-MB t-test windows spanning 
the entire genome. (C) Chromatin accessibility read-out for seven patients representing different grades and stages are shown. The top plot is the 
high-grade, high-stage tumor sample, which shows massive changes in accessibility across the genome. The remaining six plots show no accessibility 
changes but modest variation at chromosome 5 in patient #620. The x-axis represents genomic position; dotted vertical lines indicate chromosome 
start/stop. The y-axis is the log ratio of inaccessible to accessible signal at each probe on the microarray. The 23 autosomes are plotted; black line, 
normal; red line, tumor. (D) The normal tissues for five patients are plotted simultaneously, showing that differences between normal lung epithelium 
tissues from isogenic patients are very modest. 
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appropriate therapies for patients. This work represents an impor- 
tant first step in the identification of an entirely new class of chro- 
matin structural biomarkers that will pave the way for similar 
detailed studies of the role of chromatin structure across multiple 
cancer types. 

Methods 

Patient tissue. Primary samples from LAC patients with a surgi- 
cally removed tumor, and matched normal tissue were selected 
from the University of Massachusetts Medical School Tissue 
Bank (Table SI). The tumor and normal material was snap 
frozen in liquid nitrogen within 1 h of surgery. Patient samples 
were examined by Dr Stephen Lyle, using hematoxylin and eosin 
staining. Only samples with 80% or more tumor cells were 
included, as assessed by histological examination. Patient samples 
were anonymized, and we received patient history along with the 
samples. 

Tissue processing and nuclei purification. Snap-frozen 
patient biopsy samples were shipped from the tissue bank lab 
overnight on dry ice. Samples arrived as approximately 3 mm 3 
blocks of frozen tissue weighing between 400—600 mg. Samples 
were pulverized in a mortar and pestle in liquid nitrogen. For 
all microarray studies, -90% of the resulting powder was cross- 
linked in 1% formaldehyde in PBS for 10 min at RT, and nuclei 
were extracted in nuclei isolation buffer (0.3 M sucrose, 2 mM 
MgOAc 2 , 1 mM CaCl„ 1% Nonidet P-40, 10 mM HEPES, 
pH 7.8) and dounce-homogenized. 10% of the pulverized tis- 
sue sample was used for isolation of total bare genomic DNA 
for use as reference in nucleosome distribution experiments. All 
samples were flash frozen and stored at -80°C, until all samples 
were ready to be processed in parallel. We were able to isolate 
enough material to run replicates of a selection of our experimen- 
tal assays. 

MNase cleavage, mononucleosomal purification and fluo- 
rescent labeling for nucleosome distribution. Ground tis- 
sue from the primary LAC tumors and matched normal tissue 
were digested with titrated amounts of MNase (Worthington 
Biochemical Corp.) for nucleosome distribution experiments. 
Nucleosome distribution protocols were performed as previ- 
ously described. 32 An MNase titration reaction of 10 U/ml, 5 U/ 
ml, 2.5 U/ml and 1.25 U/ml MNase in MNase cleavage buffer 
(4 mM CaCl 2 , 25 mM KC1, 4 mM MgCl,, 12.5% glycerol, 
50 mM Hepes, pH 7.8) was performed for 5 min at 37°C, and 
stopped with 50 mM EDTA. MNase-digested nuclei were then 
treated with proteinase K to a final concentration of 0.2 |xg/u,l, 
sodium dodecyl sulfate (SDS) to a final concentration of 1%, 
and the cross-links were reversed by overnight incubation at 
65°C. DNA was purified by performing a phenol/chloroform 
extraction. The aforementioned steps were repeated for each con- 
centration of MNase, and each set of samples for nucleosome dis- 
tribution experiments. 

To isolate mononucleosomal DNA for the nucleosome distri- 
bution DNA microarrays (test sample), the nucleosomal ladder 
was resolved on a 2% agarose gel, and the mononucleosomal DNA 
band (-150 bp) was excised across all MNase concentrations and 
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purified by electroelution. The mononucleosomal band should 
be cut across several titrations to ensure equal representation of 
the genome. To prepare the total bare genomic DNA from tumor 
and normal samples (reference sample), we isolated genomic 
DNA from the ground tissue by resuspending in a 1% SDS and 
0.2 juLg/ (jlI proteinase K solution, incubated overnight at 55°C 
and phenol/chloroform extracted DNA. 

We used the NimbleGen protocol to label material for all 
microarrays. A fluorescently labeled random 7-mer oligonucle- 
otide primer was used in a Klenow fill-in reaction to label the 
mononucleosomally protected and bare genomic reference DNA 
samples. The mononucleosomally protected DNA sample was 
labeled with a fluorescently labeled Cy3 primer, and reference 
total bare genomic DNA was labeled with a fluorescently labeled 
Cy5 primer. One microgram of sample was used for each labeling 
reaction. The labeling materials, labeling procedure and hybrid- 
ization protocols can be obtained from the array manufacturer. 

Microarray design. We used the NimbleGen platform for 
all microarray experiments. For nucleosome distribution experi- 
ments, we utilized two custom designed high-resolution DNA 
microarrays. Each of these microarrays cover 2,000 bp flank- 
ing the TSSs of either (1) 414 cancer-related genes or (2) 472 
inflammation- and immunity-related genes, selected based 
on publications on their functional relevance to cancer or the 
innate immune response (Table S2 and S3). The TSS sequences 
were repeat masked, so only unique probes were printed on the 
microarray. Both forward and reverse strands were tiled on the 
microarray to provide experimental redundancy and account for 
strand-specific effects. The 60-mer oligonucleotide probes were 
tiled with an average 13 bp spacing (47 bp overlap). Each of the 
12 subarrays on the microarray contained approximately 135,000 
60-mer unique and isothermal oligonucleotide probes. We used 
NimbleGen HD2 design (2.1 million features per microarray), 
12-plex (12 individual isolated experiments per microarray) cus- 
tom-designed and commercially available microarrays, including 



Figure 4. An initial model for chromatin structural patterns of cancer 
progression. We have developed an initial model for tumor progres- 
sion in lung adenocarcinoma and potentially cancer in general. In this 
model, well-differentiated tumors are characterized by many changes 
in nucleosome distribution at key genes involved in oncogenic path- 
ways. Additionally, chromatin accessibility changes are characteristic of 
an aggressive tumor type. Chromatin structure-based classification of 
tumors may be critical in determining appropriate, targeted therapy. 
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built in replicates for controls. The 12-plex format allowed for 
135,000 oligonucleotide probes to be queried in each experiment. 

Microarray data, statistical analysis and computational 
model. We have developed several computational tools for the 
analysis of microarray data. Our lab developed software specifi- 
cally designed for the analysis of nucleosome distribution micro- 
array data. This software, drawGff, runs in the R environment 
for statistical computing and graphical output. Our software 
contains a variety of statistical, graphical plotting and compara- 
tive analysis tools designed to identify important features within 
and across microarray data sets. Specifically, the output of the 
microarray was a log 2 ratio of the Cy3 and Cy5 inputs. Data sets 
were normalized by standard score normalization, and smooth- 
ing was accomplished by LOESS. Scores were computed on non- 
overlapping consecutive windows, and probes were assigned to 



windows based on the probe start coordinate. The t-test mean at 
each TSS (nucleosome distribution) was calculated to determine 
changes between tumor and matched normal. 

Our software will continue to be developed and is freely 
available to the public, including tutorial and help pages 
(http://chromatin.bio.fsu.edu). All data sets and additional 
supporting analyses presented in the results are available at 
(http : / / chromatin.bio.fsu.edu/Drulineretal_201 3a) . 
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